Goto

Collaborating Authors

 real-time conversation


OpenAI rolls out advanced Voice Mode and no, it won't sound like ScarJo

Engadget

OpenAI has started rolling out its advanced Voice Mode feature. Starting today, a small number of paying ChatGPT users will be able to have a tete-a-tete with the AI chatbot. All ChatGPT Plus members should receive access to the expanded toolset by the fall of this year. In an announcement on X, the company said this advanced version of its Voice Mode "offers more natural, real-time conversations, allows you to interrupt anytime, and senses and responds to your emotions." We're starting to roll out advanced Voice Mode to a small group of ChatGPT Plus users.


ESIHGNN: Event-State Interactions Infused Heterogeneous Graph Neural Network for Conversational Emotion Recognition

arXiv.org Artificial Intelligence

Conversational Emotion Recognition (CER) aims to predict the emotion expressed by an utterance (referred to as an ``event'') during a conversation. Existing graph-based methods mainly focus on event interactions to comprehend the conversational context, while overlooking the direct influence of the speaker's emotional state on the events. In addition, real-time modeling of the conversation is crucial for real-world applications but is rarely considered. Toward this end, we propose a novel graph-based approach, namely Event-State Interactions infused Heterogeneous Graph Neural Network (ESIHGNN), which incorporates the speaker's emotional state and constructs a heterogeneous event-state interaction graph to model the conversation. Specifically, a heterogeneous directed acyclic graph neural network is employed to dynamically update and enhance the representations of events and emotional states at each turn, thereby improving conversational coherence and consistency. Furthermore, to further improve the performance of CER, we enrich the graph's edges with external knowledge. Experimental results on four publicly available CER datasets show the superiority of our approach and the effectiveness of the introduced heterogeneous event-state interaction graph.


Real life Skynet? Controversial robot powered by OpenAI's ChatGPT can now have real-time conversations

Daily Mail - Science & tech

A new automated humanoid robot powered by OpenAI's ChatGPT resembles something akin to the AI Skynet from the sci-fi film Terminator While the new robot is not a killing machine, Figure 01 can perform basic autonomous tasks and carry out real-time conversations with humans - with the help of ChatGPT. The company, Figure AI, shared a demonstration video, showing how ChatGPT helps the two-legged machine visual objects, plan future actions and even reflect on its memory. Figure's cameras snap its surrounding and send them to a a large vision-language model trained by OpenAI, which than translates the images back to the robot. The clip showed a man asking the humanoid to put away dirty laundry, wash dishes and hand him something to eat - and the robot performed the tasks - but unlike ChatGPT, Figure is more hesitant when it comes to answering questions. Figure AI hopes that its first AI humanoid robot will prove capable at jobs too dangerous for human laborers and might alleviate worker shortages. 'Two weeks ago, we announced Figure OpenAI are joining forces to push the boundaries of robot learning,' Figure founder Brett Adcock wrote on X. 'Together we are developing next-generation AI models for our humanoid robots,' he added.


Introducing MIT Technology Review Roundtables, real-time conversations about what's next in tech

MIT Technology Review

There is little doubt that generative AI will affect the economy--but how, exactly, remains an open question. Despite fears that these AI tools will upend jobs and exacerbate wealth inequality, early evidence suggests the technology could help level the playing field--but only if we deploy it in the right ways. Likewise, the Inflation Reduction Act and the Chips Act both have huge implications for the economy, and for efforts to revive America's high-tech manufacturing base. Rotman and Honan will look at who stands to benefit from these transformative economic events, and what the risks are. Then, on September 12, our next edition of Roundtables will tackle another important question: How should we regulate AI? Charlotte Jee, news editor, and Melissa Heikkilä, senior reporter for AI, will discuss the state of AI regulation today and what to watch for in the months ahead.


Unleashing the Power of ChatGPT: A Comprehensive Guide to the Working and Architecture

#artificialintelligence

Chatbots are computer programs designed to simulate conversation with human users, especially over the Internet. They can be integrated into messaging platforms, mobile apps, and websites and are increasingly being used as customer service and support tools. Chatbots use natural language processing (NLP) algorithms to understand and respond to user input, allowing them to have conversation-like interactions with users. In this blog, I will explain about how the revolutionized ChatGPT built by OpenAI works and how the internal Architecture is built to support this huge data-driven application. The modern natural language processing (NLP) model ChatGPT, created by OpenAI, is intended to produce text that sounds like human speech during discussions.